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Abstract. Some of the principal heliophysicaQinferences that have been drawn 
from, or refined by, seismology, and the manner in which those inferences have been 
made, are very briefly described. Prominence is given to the use of simple formulae, 
■ derived either from simple toy models or from asymptotic approximations to more re- 

alistic situations, for tailoring procedures to be used for analysing observations in such 
a way as to answer specific questions about physics. It is emphasized that precision is 
not accuracy, and that confusing the two can be quite misleading. 



1. Prelude 



We have learnt a great deal about the interior of the sun since helioseismology, in the 
form that we know it, began some 36 years ago. I now take stock of the situation, 
in an attempt to provide some foundation for asteroseismology, which is already well 
under way. This is not an attempt to provide a history of the subject, but is instead a 
few remarks, often in a cautionary vein, about how one goes about assessing inferences 
from seismic frequency data. I shall accommodate what I have to say within a selection 
of a few investigations that have taught us physics. The details of those investigations 
do not necessarily apply without modification to other stars, because the data available, 
both seismic and otherwise, are not of the same kind. However, in many cases the broad 
principles behind what I say remain pertinent. 

It is not inappropriate to start by describing the first helioseismological i nference. 
It came about from the production of a k-cj diagram for high-degree modes by lDeubrier 



'The term heliophysics was coined in 1981 to denote the physics of the entire sun, out to the corona. 
It is a direct translation from the French 'heliophysique', which was introduced to provide a distinction 
from physique solaire (solar phys ics) which in practice was then confi ned to only the outer layers. It is a 
subdiscipline of heliology (cf. Christensen-Dalsgaard & Gough 1976). Recently the meaning of the term 
has been extended to include the physics of the heliosphere (the space around the sun, in principle out to 
the shock where the solar wind encounters the interstellar medium, but excluding the planets and other 
condensed bodies). Here I shall confine my remarks within the original meaning. 
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(119751) . lAndo & Osakil (119751) had already carefully computed a relation from the os- 
cillations of a model of the convection zone, obtaining results very similar to Deub- 
ner's observations. But it was evident that the theoretical frequencies a>(k) were sys- 
tematically somewhat too high. In order to revise them downwards it was nec essary 



to pro duce a model convection zone with a lower adiabatic 'constant' p/p ri (IGough 



1977), implying a lower entropy for a given chemical composition, which requires a 



lesser mean value of the parameter Ti := dln/?/dlnp defining the seismic stratification 
through the upper superadiabatic boundary layer. The magnitude of the lessening was 
estimated, quite crudely, from the analytical dispersion relation for acoustic-gravity 
waves in a plane-parallel polytrope, and was seen to imply, from an earlier analysis of 
the influence of the int egral properties o f that layer on the overall stratification of the 
convection zone (IGough & Weissl H976I) . that the convection zone is about 200 Mm 
deep, some 30 per cent deeper than it was fashionable to contemplate at the time. 
So on afterwards this result was confirmed by more precise computations undertaken 
by lUlrich & Rhodes (Il977h . 



I tell this story because it illustrates a basic principle that has been used many 
times subsequently in helioseismology: that to assess the broad implications of a small 
discrepancy between theory and observation it is adequate - indeed expedient, in view 
of its simplicity - to use at first a very rough description of the possible cause. Of 
course, more precise - usually numerical - analysis is required subsequently in order 
to quantify the adjustments that must be made to the original reference model, as did 
Ulrich and Rhodes in the case I am illustrating here. In view of the smallness of the 
discrepancies between the theoretical eigenfrequencies and those observed, it is usually 
adequate to presume smallness of the structural adjustment to the theoretical model that 
is required to remove these discrepancies, and thus for most purposes it is adequate to 
perform a linearized perturbation analysis. This is the basis of most inversions, which 
I discuss in the next section. However, some investigators prefer to compare the full 
frequencies of entire solar models. 

I have found the analytical outcome from simple toy models to be extremely useful 
in exhibiting how properties of the sun depend on details of the representations of the 
physical processes one is considering. A prime example is a simplistic approximation 
to the main-sequence evolution of the solar luminosity, L(t), showing how it depend s 
on mass-loss rate and a putative variation in the constant of gravity G ( Gough| [r990b). 



The formula enables one immediately to determine, for example, the mass-loss rate that 
renders the luminosity almost constant, a desire amongst climatologists in those days 
when their theories were incapable of accommodating the inevitable rise in L of mass- 
preserving stars. It summarized the publis hed numerical investigat ions of the day, and, 
I trust, similar investigations of today (e.g. lGuzik & Mussackll2010l) . although how well 
it reproduces the latter has not yet been tested. 

An equally valid approach, available only to those with the requisite machinery, 
is to survey parameter space numerically, recording how various salient properties of 
solar models respond to changes in the ph ysi cs. In a very valuable series of papers 



sota r modets respond t o changes in tne pn ysi cs. in a very valuable series ol papers 
Christensen-Dalsgaardl (Il988b0l99llll996h and lTripathv & Christensen-Dalsgaardl (Il998h 



have published the results of an extensive study in sufficient detail for readers to appre- 
ciate how the properties of solar models respond to changes in initial conditions or 
to assumptions in the physics on which they depend, and to be able to estimate par- 
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tial derivatives and thereby carry out multi-parameter investigations for themselves. In 
particular, it permitted one to appreciate immediately the imp lications of the heavy- 
element-abundance revision proposed by lAsplund et all d2005h . a matter to which I 
shall return later. 



A comment related to the implications of the revision in the depth of the convec- 
tion zone is perhaps not out of place. Within the framework of standard stellar evolution 
theory, a deep convection zone at the canonical solar age could be achieved only with 
a heavy-element abundance, and a consequent initial helium abundance, rather higher 
than was preferred at the time. That implied a relatively high neutrino flux, which exac- 
erbated the solar neutrino problem, and heralded the role of seismology in establishing 
that the solution to the problem must lie in nuclear or particle physics. 



In 1970 Fred Hoyle, my Director at the Institute of Theoretical Astronomy, as it 
was then called, asked me to compute neutrino fluxes from solar models with gravita- 
tional settling of heavy elements. I had never before computed a full stellar model, so I 
resorted to modifying an existing evolution programme which Bohdan Paczynski gave 
me, adding neutrino production and gravitational settling, the latter rather simplistically 
by the standards of today. The objective was to determine whether gravitational settling 
reduces neutrino production. Intuition was not extensive enough to predict the outcome 
in advance, because almost all intuition of stellar evolution at that time was based on 
the initial-value problem: how the structure and evolution of a star whose radius and 
luminosity, say, are prescribed at t - responds to a modification to the representation 
of physical processes. Instead, the solar problem is a final-value problem, enquiring of 
responses of models whose radii and luminosities are prescribed at at t = t©; in practice 
they are computed by iterating the initial conditions in a series of forward computations. 
As is now well appreciated, the adjustments at t - induce reactions in the opposite 
sense to those of the modification to the ph ysics, and it therefore take s more careful 
thinking to predict the eventual outcome (cf . iFaulkner & Swensonl ll988). Fred was un- 
happy wi th my results because the neutrino flux increased, for reasons that we now un- 
derstand (IGoughll2003r) . and the work proceeded no further. But I tell the story because 
it taught me a lesson in science which I wish to pass on to those not yet experienced 
enough to have discovered it for themselves. My principal difficulty in carrying out the 
computations had been to adopt appropriate values for physical quantities arising in the 
theory, such as cross-sections for the nuclear reactions in the p-p chains: a diversity 
of values were scattered throughout the literature, and, fortunately, they had not yet 
been assembled and assessed (and improved). So I tried a variety, obtaining neutrino 
fluxes scattered wildly about a value of 20 snu , a value sim ilar to, although somewhat 
lower than, the value promulgated by IBahcall J 1964 L 19661) bef o re Homestake . Then , 
when almost no neutrinos were detected by Davis et al.1 (1 19681) . IBahcall et all (fl968) 
used 'better' nuclear data and produced a best theoretical value of about 7 snu (still un- 
comfortably high). When I looked at my results I found that the lowest of my fluxes 
were near 7 snu too, and I could go no lower. That was evidently why the neutrino 
issue became a 'problem'. What I learned from the exercise is the manner in which 
values for uncertain quantities appear to be selected to produce a 'best' model. Perhaps 
appearances are deceptive, but in order to experience them, when it comes to compli- 
cated numerical computations, it is necessary to get one's hands dirty by repeating the 
calculations oneself in order to appreciated the import of published conclusions. 
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Even though it has been stressed more than once before, it is still worth stress- 
ing again that the oscillation frequencies depend basically on only what I call seismic 
variables: principally pressure, p, exerting a force on material with inertia density, p, 
together with a quantity that relates a (Lagrangian) change 5p to the perturbation Sp that 
causes it. The most convenient quantity to adopt for that relation is typically the adia- 
batic exponent y\ = (dlnp/dlnp) s , the partial derivative being taken at constant specific 
entropy s. In a first approximation in which the sun is regarded as being spherically 
symmetrical, pressure and density are related directly by hydrostatic balance, so only 
one of them is required to specify the seismic stratification. It is important to appreci- 
ate that hydrostatic balance does not depend explicitly on y\ , so from a seismological 
point of view y\ can be regarded as being independent of p and p, although of course 
it must lie within the bounds dictated by thermodynamics. Any function of p, p and j\ 
is also a seismic variable - most common is the adiabatic sound speed c = VCTiWp) ~~ 
and a representation of the seismic variables that is consistent with the seismic data is 
called a seismic model. Of course only two independent seismic variables are required 
to specify the seismic structure completely. 

The seismic data that I have in mind in this discussion are the oscillation frequen- 
cies of normal modes, for it is they that have been used the most extensively, and they 
that are the most pertinent to asteroseismology. I should point out that I appreciate 
that the magnetic field B is also a seismic variable, as also is the angular velocity SI. 
Unfortunately, it appears not to be possible to isolate inferences about B from infer- 
ences about p and p by seismic frequency analysis alone, because for any stellar model 
with a given B there exists an isospectral model with B = and an appropriately dif- 
ferent c; any resulting degeneracy splitting can be rep resented, at least asymptotically, 
by a s uitable aspherical sound-speed perturbation (e.g. lGoughlll993UZweibel & Gough 



1995). Therefore there is always ambiguity. The eigenfunctions are different, how- 
ever, although nobody has yet succeeded in detecting and identifying that difference 
unambiguously in solar observations: to make inferences about B otherwise requires 
additional, non-seismic, information. (Rotation is different, because the distinction be- 
tween east and west is manifest as a frequency perturbation with a component that is 
an odd function of azimuthal order m, which a magnetic field or a sound-speed pertur- 
bation cannot induce.) I hasten to add that the inferences that we have made about the 
seismic structure, such as those illustrated in Figure 1, do depend also on some non- 
seismic information, namely the values of the seismic radius R, which in practice is 
related to the photospheric radius via modelling, and the total mass M of the sun. The 
seismic radius is the radius at which the variables p and p, if they were to be extrap- 
olated appropriately from the upper layers of the adiabatically stratified region of the 
convection zone, would appear to vanish. I have implicitly assumed in my discussion 
that the seismic motion is adiabatic. That is largely true, although not in the immedi- 
ately subphotospheric regions and the atmosphere immediately above: the physics in 
the outer turbulent layers of the sun is ill understood, although fortunately some aspects 
of the seismic data using modes over a (wider) range of degree / can be used to elimi- 
nate the uncertainty. However, there is much more work that could be done to improve 
our understanding of the surface, a task which, in my opinion, is urgently required for 
studying the properties of other stars whose oscillations can be observed at only a few 
low values of I, and for which spatially resolved observations similar to those that have 
been made on the sun will never be available. Consequently the effect of the surface 
layers cannot be unambiguously removed. 
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Figure 1 . Optimally localized averages of relative differences between the squared 
sound speed and the density in the sun and in Model S of Christensen-Dalsgaard et 
al. (1996), computed by M. Takata from MDI 360-day data and plotted against 
the centres x = f/R of the averaging kernels A(x; x), which here resemble Gaus- 
sian functions, and are defined by x = J xA 2 dx / J A 2 Ax. The length of each hor- 
izontal bar is twice the spread s of the corresponding averaging kernel, defined as 
s= 12 j(x-x) 2 A 2 dx - an averaging kernel A that is well represented by a Gaussian 
function of variance A 2 has spread approximately 1.7A 0.72FWHM; were it to 
be a top-hat function, its spread would be the full width, which is why s is so de- 
fined. The vertical bars extend to ± one standard deviation of the errors, computed 
from the frequency errors quoted by the observers assuming them to be statistically 
independent. 



2. On interpreting inversions of oscillation frequencies 



Inversions to determine the seismic structure of the sun are usually carried out by ref- 
erence to a theoretical model that is sufficiently close to the sun for linearization in the 
difference to be a good approximation. Here, for simplicity, I assume the sun to be 
spherically symmetrical, and I represent the structure by two independent seismic vari- 
ables yi(x) and y>2(x), where x = r/R, r being a radial coordinate. (The procedures can 
easily be generalized in an obvious way to obtain information about the asymmetric 
component of the structure). Then one can write, for mode (n, I) of order n and degree 
I: 

Sco n4 ^ f Kf' l) Sln yi dx+ f K^Shiyzdx + nfOnj)/^ , W 
Jo Jo 

where I (n ^ is the inertia of the mode, normalized at x = 1 , and 6 denotes the difference 
between the sun and the model. The objective is to express (51ny, in terms of the obser- 
vations 6co„j. The kernels ^ and K2 depend on y\ and V2 and the eigenfunctions of the 
mode in question, but not on 8\ny\ and 51nv2- They can be obtained either by perturb- 
ing an integral formula for u) n j that constitutes a variational principle, if the boundary 
conditions adopted are such that the system is self-adjoint (the differential operators in 
the governing equations can be written in self-adjoint form), or, if not, by carrying out a 
non-singular perturbation expansion of the governing differential system and obtaining 
Sco n j from the condition that a solution exists. 
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The function P(oj) is largely unknown, and was introduced as an acknowledge- 
ment that the physics, and therefore the governing differential equations, are not certain 
in and above the turbulent boundary layer at the top of the convection zone; it might 
also contain surface integrals which arise when the boundary conditions are such that 
the system is not self-adjoint. P is a function of u alone if the region of uncertainty is 
thin enough (requiring / to be low enough) for the / dependence of the oscillation to be 
negligible in that region; otherwise it depen ds also onw = co/L, where L = I + 1 /2, and 
can usefully be expanded in powers of w~ 2 dGough & Vorontsovll 19951) . 



The outcome of an inversion is a sequence of estimated spatial averages Slnyy and 
£lny2 of (51nyi and 51ny2 weighted by unimodular (i.e. having unit integral over the do- 
main of existence, here < x < 1) averaging kernels A\(x; x) and A2(x; x) respectively, 
each of which is a linear combination of K^(x) or K^' l \x) with coefficients Cj or 

which depend on x. One would normally like the averaging kernels to be well 
localized about x = x, for then the averages, which are represented by the same com- 
bination of the frequency differences as are the averaging kernels, are relatively easy 
to interpret: plotted as a function of x, they are essentially a blurred view of c>lnyi and 
£lny2, at least when x represents the (actual) centre x of localization of A\ or A2. (There 
are some who regard them as tracing the actual functions £lnyi and 6lny2', that would 
not be a bad approximation if A\ and A2 were very well localized, which can be the case 
within some ranges of the independent variable x.) It should not be necessary to know 
how the averaging kernels were constructed in order to interpret the inversions; but it 
is necessary to appreciate that the final outcome is not uncontaminated: the average of 
£lnyi, say, is actually 

51nyi := I A[(x; x) 51nyi dx - ^ c^(x) 6co n j + Hidlnyj) , (2) 
J n,l 

where Ay = Dn,/ C i ^1 an(1 tne residual is given by 

* = ~ f 1 S cf\x)Kf l) S[ny 2 dx - V cf\x)P{aj n j) 1 1^ . (3) 



The corresponding expression for 5\ny2 is similar. It is evident that, in addition to 
obtaining a localized averaging function A 1, it is desirable also to reduce the magnitude 
of % to a (practical) minimum, in order to obtain the best approximation to the average 
in terms of the the data by ignoring H on the right-hand side of equation ©. To judge 
the outcome one needs some information about the degree to which that reduction has 
been achieved; such information is rarely available. One also needs to be informed 
of the characteristic range of x over which the averaging is taken, and the estimated 
uncertainty (standard error) of the value of the average that results directly from errors 
in the data. That information is usually provided by horizontal and vertical bars, such as 
those in Figure 1, which respectively represent a characteristic width of the averaging 
kernels and the standard deviation of the value of the approximated average resulting 
from the standard errors in the data. It would be useful also be given an idea of the 
shapes of the averaging kernels; some have significant side-lobes far from x, typically 
near the sun's surface, demanding some care in interpreting those averages, and possible 
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only if the side-lobe structure is known. One also needs to know how x is defined. If 
the averaging kernel is well localized and x is some representation of the location of 
its (suitably defined) centre, then the precise definition is not very important. But if 
x is merely the value about which the author had tried to locate the averaging kernel, 
which unfortunately is sometimes the case, then other kinds of information are needed 
for interpreting the published results. 

A common approach to inversion is to try to construct well localized kernels ex- 
plicitly: a procedure called OLA (optimally localized averaging). Greater localization 
usually results in greater error in the averages arising from errors in the frequency data, 
because increasing localization increases the magnitudes of the coefficients c. ; more 
drastic cancellation arises in the sum of the actual frequency differences on the right- 
hand side of equation (O, but that is not shared by the random measurement errors. So 
a compromise must be made. Just how that compromise is made depends on the judge- 
ment of the inverter. Once 51ny; have been estimated, one can iterate by estimating 51ny2 
from its average £lny2 (which necessarily requires the adoption of assumptions, such as 
smoothness, and possibly some prejudices gleaned from one's experience with theoret- 
ical models), from which H can then be estimated and incorporated into the constraint 
©■ 

Another approach is to try instead to fit the data optimally, using parametrized 
representations of (Jlnyi and (51ny2, typically expressed as linear combinations of preas- 
signed basis functions with coefficients chosen such as to minimize by weighted least 
squares the differences 6co n j given by equation ©. The outcome is a linear combina- 
tion of frequencies from which averaging kernels A\ and A 2 can be constructed. As 
with OLA, the procedure must be regularized, usually to favour smoothness, to prevent 
excessive cancellation and consequent excessive error magnification. But, unlike OLA, 
both functions dlnyi and 51ny2 are automatically accounted for simultaneously. This 
regularized least-squares (data-)fitting (RLSF) method is usually abbreviated as RLS. 
When it is used, rather than plotting the averages weighted with A\ and A2, whose 
side-lobes are invariably worse than those explicitly designed by OLA to abhor them, 
it is common merely to plot the parametrized representations of 51nyi and 51ny2 that re- 
sult. These are but a single example of the infinite set of functions that (approximately) 
satisfy the data. 

Inversions are sometimes carried out using asymptotic approximations to the eigen- 
functions that yield equations which, after appropriately smoothing the data, can be 
formally inverted analytically (although the final result must be evaluated numerically) 
to yield a smoothed representation of the seismic structure. The procedure has the ad- 
vantage of being simple and fast, and, because the product is an explicit integral, one 
is readily able to appreciate how certain features in the data relate to features in the 
seismic structure. Moreover, it does not rely on a reference theoretical model of the 
sun. The outcome is a nonlinear combination of the frequencies, so simple averaging 
kernels are not available. Inversions such as these are often criticized for being less 
precise than OLA or RLSF; that they are less precise is indeed often the case, but, 
by not depending on prejudices such as those upon which any reference model must, 
they are not necessarily less accurate. That remark applies particularly to inversions 
for structure, which actually depend in a nonlinear way on the functions being sought 
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Figure 2. Three difFerent sets of optimally localized averages of the relative dif- 
ferences between the squared sound speed and the density of the sun and Model S of 
Christensen-Dal sgaard et al . (199 6), all inferred from the so-called 'best frequency 
set' selected by Ba su et all d!997l) . The vertical bars represent ± one standard de- 
viation of the propagated frequency errors, assuming those errors to be statistically 
independent. Characteristic kernel widths have not been drawn for fear of unduly 
cluttering the diagram (courtesy G. Houdek). 



(and, under some circumstances, in consequence require iteration); it does not apply to 
inversions for angular velocity Q, because the sun's rotation is dynamically weak, and 
the dependence of the eigenfrequencies on Q. can be linearized, leading to relations of 
the type (Q} (with y\ = Q, and y>2 = 0) with kernels K^"'® that do not depend on Q. It is 
worth mentioning that it can be convenient to apply asymptotic methods directly to the 
inversion of the linearized constraints (Q]) for the structure too, for there it is only the 
small difference between the sun and the reference model that is being approximated; 
as I advocated in the first paragraph of the prelude, it is often expedient to start an in- 
vestigation with a simple quick analysis, and in some circumstances that analysis can 
even achieve adequate precision for the purpose in hand, with the added advantage of 
a partially analytical appreciation of how the outcome depends on the data. Examples 
of optimally localized averages of sound-speed differences between the sun and prob- 
ably the best reference solar model at our disposal, namely Christensen-Dalsgaard's 
Model S, now an improved versio n of the model with the same name discussed by 
dChristense n-Dalsgaard et al.lll996l) . are presented in Figure 2. They were obtained 



with the same data by several different inverters. 

Despite the abscissa, here labelled r/R, not being defined, and likely not being the 
same for the different plots, the most striking feature is that the differences between 
the differences exceed the quoted errors by a substantial margin: the accuracy of the 
results, as naively suggested by the figure, appears to be significantly less than the 
precision. It must be appreciated, however, that that is actually not the case, assuming 
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Figure 3. The dashed line is the relative difference between the sound speed in 
a stan dard solar model computed by Bahcall and Pinsonneault and that inferred by 
Basu (Bas u et aT]ll997|) fro m a combination of osc illation frequency data ob tained 
by lChaplin et aljdl996f) and lTomczyk et all (Il995allbh (from lBahcall et alJl200ll) . 



that the inverters have not made a technical computational error, which I'm sure they 
have not. What must be the case is that the averaging kernels are rather different, and 
therefore so are the quantities plotted. One might also note that the tradeoff between 
kernel spread and error magnification appears to be different from that adopted for 
Figure 1; the errors are smaller here, suggesting broader kernels, which is consistent 

with the tachocline anomaly - the hump in Sc 2 /c 2 immediately beneath the base of 
the convection zone which could have resulte d from homogeni zation in the tachocline 
and which is probably too thin to be resolved (Ell iott etail 19981) - being broader, 
though the data set employed here is different. 



even 



An example of the sound speed in another solar model is illustrated in Figure 3. 
The smallness of the discrepancy has been employed often by Bahcall, who, by plot- 
ting the error in the theoretical model rather than the solar sound speed relative to a 
reference model, presumably, and quit e correctly, tru sted the seismology more than his 
modelling. Indeed , he had co ncluded (Bahca llll200ll) that the ability to a djust standard 
solar models (e.g. iBrun et al.|[l999l 120001 : iBahcall & Pinsonneault! l2000h to bring their 
neutrino fluxes within a mere 20 % or so of tho s e mea sure d at SuperKam iokande and the 
Sudbury Neutrino Observatory dAhmad et all (1200 ll) . cf. lAhmad et"ai1 (120021) ) demon- 
strates 'a triumph for the theory of stellar evolution ... which is a cause for rejoicing 
among astronomers'. 
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3. Macroscopic Physics 



The most fertile arena of macroscopic physics has arisen from studies of kinematic^ 
The most plentiful, and the most reliable, are measurements of rotation. The reason is 
partly that the signature in the oscillation frequencies is greater than those from other 
components of flow, but principally because it is only rotation that is uniquely identifi- 
able: by the property that eastward and westward (azimuthally averaged) macroscopic 
motion would split the degeneracy of the seismic frequencies with respect to azimuthal 
order m in opposite senses. The splitting is caused by both advection and a Coriolis 
force, and is a function that has an expansion in powers of m with non-zero coefficients 
of both odd and even powers. An expansion of frequency splitting produced by any 
other (aspherical) perturbing agent contains only even powers of m. 

Plots of the time-averaged angular- velocity averages H(r, 9) are presented in other 
contributions to these proceedings, so I refrain from doing so too. The dominant fea- 
tures are the (principally latitudinal) differential rotation in the convection zone, the al- 
most (perhaps exactly) uniform rotation in almost all of the radiative interior, together 
with the thin tachocline separating the two. The tachocline is now amongst the most 
active arenas of heliophysical research (e.g. iHughes et all 120070 . The rate of rotation 
of the radiative zone is such that the spherically averaged linear velocity suffers a 7% 
decline at the equator going downwards across the tachocline. 

That the angular velocity deep in the interior is not much greater t han that ob 



serve d at the surface was realized in the early days of helioseismology (IDuvall et al. 



1984). That was a surprise to many theorists. It had generally been believed that, as a 
result of the retarding torque applied via Maxwell stresses to the sun by the solar wind, 
the surface layers must now be rotati ng substantial ly more slowly than the deep interior; 
debate was about only by how much. iDickd dl964l) . for example, maintained steadfastly 
that the difference is substantial, j ustifying his position with his surfa ce oblateness mea- 
surements with Goldenberg (see iDicke & Goldenbergl 1 1967L Il974l) . and arguing that 
viscous stresses in the radiative interior were insufficient to tie the core to the surface. 



He used the assertion to support his theory of gravity with Brans ( Brans & Dickdll961 ): 



very roughly speaking the theory regarded Newton's constant G not as a true constant, 
but as a field which satisfied an inhomogeneous wave equation fed by matter - gravity 
was thereby less tightly connected to matter, and therefore the precession of planetary 
orbits, for example, must be slower than what had been predicted by General Relativ- 
ity; appropriate rapid rotation of the solar interior was therefore required to distort the 
external gravitational field enough to make up for the planetary precessional loss. The 
helioseismological finding destroyed that argument. 

Dicke's weak-spin-down claim trig gered more cogent flu i d-dyn amical discussion, 
principally by Howard et al.1 (1 19671) and [Bremerton & Spiegel ( 1968), who pointed out 
that the baroclinicity induced by the differential rotation of spin-down drives a global 



2 Invariably, in the present context, called dynamics, although no direct helioseismological measurement of 
a force has ever been made. However, the kinematical inferences have spawned a great deal of theoretical 
research into the dynamics. 
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meridional flow which transports (negative) angular momentum downwards from the 
convection zone, thereby slowing the core much more quickly. The process was likened 
to that operating in a stirred cup of tea, in which the tea slows on a timescale equal to the 
geometrical mean of Dicke's global (viscous) diffusion time and the period of rotatiorQ. 



Interestingly, it was lEinstein (1926) who first used such an argument, in explaining 



the meanders of rivers, not realizing at the time that he was establishing ammunition to 
be used much later in defence of his General Theory of Relativity. 

The detailed knowledge we now have of 0(r, 6, t), coupled with our knowledge of 
the seismic hydrostatic stratification, enables us to calculate the shapes of the gravita- 
tional equipotentials more accurately than by any other means available today. They 
can be expressed in terms of the (eve n) coefficients J 2 k of a multipole e xpansion. The 
quadrupole moment, J 2 = 2.2 x 10~ 7 dSchou et al.lll998l : IXntia et al.ll2008h . contributes 
the most to the precession of the perihelion of the orbit of Mercury. That value induces 
an orbital precession which, with the current measurement precision, is too small to 
influence the precessional test of the theory of General Relativity. 

The baroclinicity induced by the (latitudinal) differential rotation of the convec- 
tion zone also drives meridional flow, which, unless strongly opposed by some agent, 
would transmit the latitudinal variatio n of O into the bulk o f the radiative interior on a 
timescale less than the age of the sun dSpiegel & Zahnlll992l) . Notwithstanding Spiegel 
and Zahn's hypothesis that appropriate anisotropy of turbulence induced by instability 
of that flow w ould confine the rotational shear to a tachocline (a process challenged by 
lEUiottl d!997l) ). Mclntyre and I (1998) have argued that no purely fluid-dynamical pro- 
cess can maintain the uniformity of O: in the radiative interior the only remaining pos- 
sibility is a presumably fossil, and therefore no doubt predominantly dipolar, magnetic 
field. Because magnetic diffusivity is so small, the (horizontal component of the) field 
is prevented from penetrating the tachocline by the downwelling tachocline flow, which 
occurs at all latitudes except those at which rotational shear is negligible (near latitudes 
±30°). I still believe this to be the case, despite the counterclaim by iBrun & Zahnl 
( 20061) . which they tried to support with (nec essarily excess i vely diffusive) numerical 



simulation. Garaud and her colleagues (e.g. iGaraudl 120021 : iGaraud & Garaud! 120081 : 



iGaraud & Acevedo Arreguirill200 9) have carried out a series of calc ulations with lower 
diffusion coefficients, but at the price of assuming axisymmetry; and 1 Wood et all d201ll) 



have gone a long way towards demonstrating the case. But there have been problems 
with preventing field peneuation of the tachocline near the poles, where the dipole field 
is vertical and where the tachocline circulation has al most no horizontal c ompo nent to 
sweep it aside, notwithstanding the demonstration by Wood & Mclntyre] (1201 lh of the 
existence of a steady state with the field confined to the radiative interior, even when 
it is symmetric about the rotation axis. I believe that that particular problem is due at 
least partly to the s uperficially sim plifying assumption of axisymmetry. As has been 
discussed elsewhere dGoughll2012l) . were the axis of the magnetic dipole initially not 



3 The analogy is not perfect, because, unlike a cup of tea, the sun is thermally stratified , and spin-d own is 
moderated via thermal diffusion, adding richness (i.e. complicating) the analysis (e.g. Dicke 1967). But 
the general principle remains; the basic dynamical processes are not completely prevented from operating. 
A recent quantitative investigation of the processes involved has been presented by Spiegel and Zahn 
(1992) in another context. 
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to have been aligned with that of the angular velocity in the tachocline, the tachocline 
circulation is likely to have applied a torque between the dipole and the convection 
zone in such a sense as to cause the dipole axis to migrate towards the latitudes of zero 
tachocline shear. Unfortunately the magnetic field appears to be too weak for that to be 
detected directly by seismology. 

In addition to the magnetic field impinging on the tachocline from beneath, there 
is also the possibility of the field being pumped or diffused into the tachocline from 
above. That field is likely to change sign with t he solar cycle, and hence decay in a 



distance much less than the tachocline thickness (lGaraudlll999h . even in the presence 



of the baroclinically drive n tachocline downwelling flow of the magnitude inferred by 



Goush & Mcl ntvre ( 1998). 



An important consequence of the tachocline circulation is that it mixes back into 
the convection zone helium and heavier elements which tend to settle under gravity. 
That process reduces the mean molecular mass fi of the material in the tachocline, and 
thereby increases the sound speed. That, I am sure, is the explanation of the tachocline 
sound-speed anomaly evident in Figures 1 and 2 as a hump between x = 0.6 and 0.7 in 
the sound-speed excess over that in Christensen-Dalsgaard's theoretical Model S. Note 
that it actually represents a smoothing of the sound speed. I maintain also that it is 
probably the meridional circulation that is the primary smoothing agent, notwithstand- 
ing the possibility of additional small-scale shear turbulence or convective overshoot. 
Think of the Gulf Stream, which transports, primarily by advection, heat from the Car- 
ribean to the coasts of north-western Europe. The associated turbulent transport is too 
weak to compete. Then notice that the Richardson number N 2 /(AQ.) 2 associated with 
the rotational shear in the tachocline (which is 10 12 times the Richardson number as- 
sociated with the tachocline circulation) - about 3 x 10 6 - is some hundred times more 
than that pertaining to the Gulf Stream. So the sun appears to be much more stable, and 
is unlikely to be subject to more intense hydrodynamically driven turbulence than is the 
Gulf Stream. However, there does remain the possibility of magnetorotational insta- 
bility, which is difficult to assess because the configuration of any weak magnetic field 
that might be present in the body of the tachocline (which is essentially nonexistent in 
the picture I have just painted) is not known. 

The physics of the remaining, large-scale, difference between the sound speed in 
the radiative zones of the sun and Model S has not been convincingly identified. There 
are a variety of possibilities, some of which I shall address below. The form of the 
sound-speed in the adiabatically stratified region of the convection zone of the sun is 
reasonably well established: c 2 {y\ - 1) GM(r~ l - where M is the mass of the 
sun out to the radius r (assumed constant for deriving this approximate relation) and 
71 - constant, so it appears that the discrepancy must result principally from adopting 
the wrong value of the seismic radius R. However, that does not explain the entire dis- 
crepancy; the complete resolution may be related to asphericity at the base of the con- 
vection zone, although I hasten to add that the base of the tachocline is almost certainly 
spherical, because the magnetic field is not strong enough to support any significant 
asphericity against the fi gradient in the radiative interior. 
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Returning to the angular velocity, I believe that o ther features of th e helioseismo- 
logical infere nces, such as the subp hotospheric shear (ISchou et al.lll998l). the torsional 
oscill ations ( Vorontsov et al.1 |2002|) and the tachocline oscillation ( Howe et"al] |2000L 
201 lh . which, with eye of faith, might also be discernible another half-wavelength or 



so deeper into the radiative zone, at least before the oscillation disappeared in 2000, do 
not have simple cogent explanations - with clarity sufficient for me to explain them to 
my grandchildren - although they are rightly (except, perhaps, the so-called tachocline 
oscillation) subjects of current research. However, the seismic inferences concerning Q. 
are providing an invaluable reference to guide the very fruitful numerical simulations 
of solar convection discussed by Toomre in these proceedings. 



4. Microscopic Physics 



The microscopic physics that might be accessible to seismological investigation con- 
cerns principally the adiabatic exponent y\ and the thermally pertinent quantities k and 
e: opacity and the rate of generation of heat by nuclear reactions. The second and 
third are evidently accessible only with the help of non-seismic information, because 
they involve temperature, which is a non-seismic quantity. Moreover, investigation of 
the first also requires non-seismic augmentation, because there is no redundancy in the 
equations governing adiabatic seismic oscillations. In all cases it is necessary to con- 
sider properties of theoretical stellar models, deriving from them constraints (always 
subject to the adoption of certain assumptions) that can be imposed upon seismological 
inferences. 



Adopting an equation of state that delivers y\{p,p;Xj), where Xi are the abun- 
dances of the chemical elements, enables one to estimate, at least in principle, those 
abundances seismologically, using the depression of y\ by ionization. Provided that 
the domain of investigation is deep enough in the convection zone where we believe 
the stratification to be adiabatic - itself a non-seismic constraint - the consequent re- 
lation between the variation of p and p restricts ambiguity in the magnitude and form 
of the depression, thereby permitting a calibration of X,. To date, only the abundance 
of helium has been reasonably reliably estimated (the hydrogen ionization zone lies in 
the superadiabatic boundary layer whose structure is uncertain because it depends on 
the treatment of convection, and therefore the depression of y\ cannot be measured) 
- the yi depression resulting from the ionization of individual heavier elements is too 
small to measure - although measurement of a combined depression is planned for es- 
timating the total heavy-element abundance by accept ing relative abundances o btained 
from spectroscopic studies of the solar atmosphere (Mussack & Gough 2009). This 
could b e fraught with uncertainty, becaus e recent modifications to the relative abun- 
dances (IGrevesse et al.ll201 ll : ICaffau et al. 2011), whose effect on opacity I mention at 
the end of this section, are still in some doubt. Additionally, an estimate based on the 
spatial mean value of a diagnostic t hermodynamic fun ction &(y\ ; r) which responds 
to ionization has been undertaken dAntia & Basull200"6h . but that procedure relies on 
the absolute value of © rather than its local devia tions, and is theref ore much more 
susceptible to uncertainties in the equation of state ( Baturin et al.ll2006T) : the reliability 
of the value of j\ obtained from modern equations of state has been discussed ex- 
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tensively ( Christensen- Dalsgaar d & Dapped 1 19921 : iDappen et aL 1990 j_ Dappen 19981 



Dap pen & Navfonqvll2000l) . and was questioned by lBasu & Ch ristensen-Dalsgaardfi 1997 ) 



and by iBasu et all (119991) . who attempted a seismological inversion for y\ , suggesting 
that the uncertainties are as great as the heavy-element induced depression itself. 

For the sake of the unwary reader, I draw attention to the fact that the question- 
ing has not completely been answered, because is was not possible to eliminate un- 
wanted integrals such as that in K on the right-hand side of Equation (2) to isolate 
Slnyi , an inevitable consequence of the lack of redundancy in the oscillation equations 
which I mentioned earlier, and which render it logically impossible to determine the 
intrinsic error in y\ by seism olo gical means alone. So me progress was made later by 
iRabello-Soares et all (|2000) and Di Ma uro et all (120021) . in which the functional form 
of the seismologically inaccessible error of j\ was estimated by implicitly using what 
I presume was assumed to be a more robust aspect of that same equation of state. It 
was concluded that quite good estimates of the uncertainty in j\ could be obtained in 
regions in which (<91nyi/(91nF)„ is small (here u - pip is the square of the isothermal 
sound speed), but not in the helium ionization zone where it is not (and where a reliable 
equation of state is needed for a sound determination of the helium abundance). There- 
fore there remains some uncertainty in direct seismological estimates of the helium 
abundance. 



I judge from the modern literature that the initial (ZAMS) helium abundance of 
the sun has been determined to be Yq = 0.25 ± 0.01, the 'errors' being estimates of 
accuracy, not precision. Interesting ly, this is the same as the value estimated from early 
seismic model-fitting (e.g. lGoughlll983al) . although in those early days the precision 
was only half as good as it is today, and the uncertainty was probably rather greater. 

I have spoken of the adiabatically stratified region of the convection zone as though 
its existence is obvious. But should that not be checked? To be sure, the interiors of 
the high-Rayleigh-number convection zones with which we are more familiar, such as 
occur in some laboratory experiments and in clouds in the Earth's atmosphere, appear 
to be adiabatically stratified, and modern numerical simulations such as those discussed 
by Toomre in these proceedings exhibit that property too. But should such arguments 
be trusted? As a pertinent aside I might remark that the original motivation for Davis's 
neutrino observatory was to confirm what most astrophysicists took for grant ed: that the 
sun re ally is powered by nuclear transmutation, even though the argument by Eddington 
(1926) seemed invincible, if not when it was first propounded. Much effort was sub- 
sequently expended in checking the details. Likewise, it should perhaps be considered 
a worthy endeavour to investigate convective stratification more thoroughly. The dif- 
ficulty in checking it seismologically is that its effect on the propagation of acoustic 
waves is only via the influence of buoyancy, which is tiny and therefore necessarily 
limits precision severely. So far as I am aware, there has b een only one attempt to test 
the stratification, yielding ly" 1 - T~ l \ - |V - V ad | < 0.03 (lGoughlll984l) . which, given 
that mixing-length theory predicts values of order 10 -6 in the lower half of the convec- 
tion zone, may not seem a very tight constraint. It would be interesting to repe at the 
exerci se with modern data using a direct inversion similar to that described by lElliott 
(119961) . 
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I come now to opacity. Early sound-speed inversions dChristensen-Dalsgaard et aL 

1985) suggested that opacity computations of the day were about 20 per cent too low 
immediately benea th the convection zone down to temperature s of about 4x 10 6 K. Sub- 
sequent scrutiny (Igles iaset al.lll990l : llglesias & Rogers!ll991 ) revealed an error in the 
treatment of spin-orbit coupling in the radiative-transition calculations that had been 
carried out at the Los Alamos National Laboratory, and some other, more technical, 
matters into which I shall not delve here. The error was found to affect k by even more 
at lower temperatures, not relevant to the sun because they occur in the convection zone. 
Correcting k resolved sev eral important issues in astrophys i cs, such as the excitation o f 
(3 Cephei and SPB stars dMoskalik & Dziembowskilll992l : iDziembowski et alj[l994h . 
With the new opacities, and other improvements such a s the incorporation of gravi 



tational settling against diffusion, the superb Model S of IChristensen-Dalsgaard et al. 



19961) was constructed; it has remained the most well-used reference model ever since. 



Opacit y became a hot topic agai n following the new spect roscopic abun dance 
analy ses bv lAsplund etail (I2005L 120091) . iGrevesse et all (1201 lh and lCaffau etaD(l2009L 



f ilih . in which three- dimensional hydr odyn amical solar atmosp heric models of lStein & Nordlund 
998b (see also iNordlund et al.ll2009h and ICaffau et all (120081) were used instead of 
one of the usual one-dimensional essentially hydrostatic models. The surprise was 
that the abundances of the opacity -producing elements C, N and O were first reported 
by Asplund and his colleagues to be about 30% lower than previously believed, al- 
though the values have risen somewhat since; those reported by Caffau and his col- 
leagues are somewhat h igher still, although low compared with the older values of 
Grevesse & SauvaJ dl998h Although the photosheric abundance of Ne, the remaining 



substantial contributor to opacity, cannot be measured accurately, it seemed likely that 
the effect of the convective fluctuations on the spectrum lines i nfluence the abundance 
analysis similarly (thereby laying doubt on the suggestion by iDrake & Testal (12005 ) 
that t he abundance of Ne is very much g reater than the value normally adopted - see 
also lYoungl d2005h : lAsplund et all (|2009)). This posed a problem. The effect of vary- 
ing opacity in solar models was already known to change the sound speed nontrivially; 
indeed, in an early series of papers Christensen-Dalsgaard demonstrated that 6c 2 / 6^, 
where £ is almost anything, is nonzero, and quite different for different parameters, or 
functions, £ - sufficiently different that it is unlikely that they are not linearly indepen- 
dent, and so adjusting other properties of the models could not plausibly be contrived 
to cancel the opacity discrepancy. Therefore Asplund's result destroyed the apparently 
superb correspondence of the seismic variables of Model S with reality. It was imme- 
diately obvious that something must be done to restore the opacity. 



The problem posed by Asplund within the framework of standard solar-evolution 
theory is easily understood. Consider first the known seismic structure. Then note that 
the variation of the surface luminosity L(t) is insensitive to assumptions about the inter- 
nal structure. Consequently Jl dt is well determined - given that we (think that we) are 
pretty sure of the sun's age - and so therefore is the total amount of hydrogen that has 
been consumed^. We can therefore safely take the absolute deficiency in the hydrogen 



4 To be sure, there are variations amongst models in the balance of the ppl and ppll chains which modify 
the total somewhat, but that is small compared with the enormous change in the value of the total heavy- 
element abundance Z that we are addressing: so too is the effect on the equation of state. 
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Figure 4. The continuous line is the relative difference between averages of the 
opacity k in the sun and the opacity in Model S of Christensen-Dalsgaard et al. 
(1996), inferred from optimally localized averages of sound-speed and density dif- 
ferences coupled with an estimate of the helium abundance Y and the assumption of 
thermal balance throughout, as described in the text. The dashed curve is the cor- 
responding difference along the inferred p — T — Y path in the sun, plotted against 
the centres o f the averaging kernels. The dotted curve is 0.01(51nA-/c?lnZ) p jx (from 
lGougrj2004h . 



abundance AX(r, t) to be the same as that in Model S, and hence obtain from p and p 
the temperature T in terms of the (presently unknown) initial hydrogen abundance Xq. 
From the outcome can be calculated the total rate of generation of energy by nuclear 
reactions, from which Xq can be determined by equating that rate with the observed lu- 
minosity. To be sure, the last step depends on accepting nuclear-reaction cross sections, 
but after decades of investigation by those in pursuit of a resolution to the solar neutrino 
problem I recommend that they be accepted, at least for the time being. One now has 
all the quantities present in the radiative transport equation, save the opacity k. Hence k 
can be evaluated. The difference between that and the opacity in Model S is plotted in 
Figure 4. The problem posed by Asplund is simply to reconcile that function with his 
surface abundance measurements, which seem to imply opacities that differ from the 
those in Model S by some 30 per cent or so. Of course one could instead have taken a 
different route by accepting the opacity and computing the nuclear energy generation 
rate e, but the outcome of that would obviously have been to find regions in the sun in 
which e < 0. Surely that also supports my recommendation not to do so. 



In attempts to shed light on the matter, many papers were written in which prop- 
erties of solar models were changed, some accompanied by un necessary inversions , 
reiterating what Christensen-Dalsgaard had taught us in the past; iBasu & Antial (120081) 
have compiled a useful catalogue. However, some were novel. If one sets aside the 
idea that the new abundance estimates are too low, despite the care that h as gone into 
deriving them, the paper that stands out from the rest in my mind is by Guzi k et al. 
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( 2005 ). who cut the cackle0by rejecting standard solar-evolution theory and posited 



that the chemical composition in the sun's radiative zone is essentially no different 
from that in Model S. How could that be? The hypothesis is that, after most of the 
sun had condensed from the interstellar medium, it accreted metal-deficient material, 
obscuring its true colours. Such accretion could hardly have occurred well into the 
main sequence, since the accreted material had to be gaseous, and would have been 
inhibited by the solar wind. But, granted that the proto-solar accretion disc was inho- 
mogeneous, with incipient planetary condensations into which solid grains were pref- 
erentially drawn, couldn't the largest of them have actually seeded the sun, onto which 
some normal, somewhat metal-deficient, gaseous disc material subsequently a ccreted? 
A somewhat modified story was entertained recently by iMelendez et al.l ( 2009b : accre- 



tion o nto the early sun of dust-cleansed proto-planetary nebular material, essentially 
what Guzik et al. (2005) had had in mind. In either case, the chemical inhomogeneity 



that remained in the sun would be Rayleigh-Taylor stable, and stable also to fingering, 
and would plausibly have created a discontinuity in composition, beneath the present- 
day base of the convection zone, that has survived until today. If so it would have 
seismological consequences: it would produce an oscillatory signature in the eigenfre- 
quencies not unlike that produced by the abrupt changes in stratification at the base 
of the convection zone that we model already. Its amplitude would be no greater than 
about 25% of that of the convection-zone signature, and the two would be entangled, 
rendering unambiguous detection difficult. Nevertheless, it is worth looking for. 



In a similar vein to my preferred statement of the abundance problem. IChristensen-Dalsgaard et al 



(2009) have recently asked the question: by how much would the opacity formula need 
to be changed were the chemical composition in the radiative interior to be (after ac- 
counting for gravitation se t tling, of course) changed to be consistent with the deter- 
mination bv lAsplund etaD (l2009h ? The outcome is an almost linear function of log T 
(along the thermodynamic p - T path of Model S), declining from about 30% at the 
base of the convection zone to about 6% at the centre. One might have thought that it 
should be simply - Yiiidx/dX^SXi, once again, computed along the p-T path of Model 
S, where 6Xj are the Asplund-Grevesse modifications. I tried approximating the opac- 
ity modification by (di</dZ)SZ assuming the relative abundances not to have changed, 
and was somewhat surprised to obtain a rather different result: my modifications were 
less close to being a linear function of log?", b eing some 5% greater than the values 



obtained by Christensen-Dalsg aard et all ( 20091) immediately beneath the convection 



zone, and about 2% less interior to r/R - 0.5. That caused me to wonder whether I had 
made a mistake in my simple calculation. However, during my presentation at the Fuji- 
hara seminar Christensen-Dalsgaard assured us that his result deviates from my simple 
estimate because in his calculations the relative abundances of the opacity-producing 
elements had also been modified. That hadn't been clear to me when I read the paper. 
Is this an example of where simple calculations go awry? Yes, if one takes the results 
too seriously without assessing the precision of what is being done, or without being 
absolutely sure of what one is comparing the outcome with. So here is another lesson 
to be learned. 



Jeffreys & Swirlesl dl956l) : lTrewiri ( [19671) 
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5. Seismic model-calibration 



Calibrating theoretical solar models against seismic data was the first means by which 
inferences about the inner state of the sun were drawn. I have already mentioned in 
the prelude to this contribution the calibration of the upper superadiabatic convective 
boundary layer, first from an analytically estimated perturbation, then by direct com- 
parison of the full oscillation frequencies of a set of numerically computed envelope 
models. That led to the first seismological revision of the depth of the convection zone, 
and then, using that result as a further calibrating datu m for full sola r models, to a seis- 
mic estimate of the initial helium abundance Yq (e.g. lGough|[l983bh . The location of 
the base of the convection zone has been used extensiv ely as a datum for assessing or 
calibrating solar models since (e.g.|Bahcalletal. 1998), although there are exceptions. 



Nowadays, more extensive, and often more highly processed, seismic data are used, 
including othe r aspects of entire seism ic models, to assess or calibrate evolved solar 
models (e.g. iTurck-Chieze et al.ll200#l 



The first overt global seism ic calibration of entire solar models was simply a naive 
least-squares frequency fitting (IChristensen-Dalsgaard & Gough| [l981). Interest was 



principally in the abundances of helium and heavy elements, and the consequent impli- 
cation concerning neutrino production, so only low-degree modes which penetrate into 
the radiative zone were used. Several local minima in^ 2 were found (depending on the 
values adopted for the orders n of the modes that were used, for they were not known at 
the time), the best two having helium abundances Y of a little above and substantially 
below 0.25 (a commonly favoured value of the day), and corresponding heavy-element 
abundances above and below 0.02. The helium-rich fit was somewhat better, although 
perhaps not significantly so. However, if one coupled those frequency fittings with 
the earlier seismological finding from high-degree modes that the convective zone is 
deeper than previously preferred, then the helium-rich solution was definitely favoured. 
That appeared to establish that the neutrino problem was an issue for nuclear or particle 
physics, not directly a problem for global heliophysics. Subsequent full inversions of 
the kind illustrated in Figures 1 and 2 were required to put the final nail in the coffin. 
What was significant was that the frequency residuals of the best-fitting model were 
substantially greater than the standard errors in the data. Thus it was evident that none 
of the models adequately represents the sun. That does not imply that the models are of 
no use to address specific scientific questions: rather than mindlessly trying to fit all the 
data together, one should try instead to extract from them signatures that are sensitive 
to the matter in question, and insensitive to extraneous properties. 

Calibrations come into their own when the matter in hand is beneath the resolution 
of straightforward inversio n. An importan t exa mple is the thickness o f the tachocline. 
As originally conceived by Spiegel (1 19721) and ISpiegel & Zahnl dl992h . the tachocline 



6 Turck-Chieze et al. called the seismically calibrated model a seismic model, notwithstanding its strong 
dependence on nonseismic argument. That is contrary to the usage of the term in this discussion, and to 
common usage in both helioseismology and geoseismology. I remark that RLS data-fitting could also be 
regarded as a (functional) calibration procedure, but because no non-seismic constraint is overtly imposed 
(other than the values of M and R - the regularizing penalty function is apparently arbitrary, although its 
definition is seismically motivated), it is reasonable in that case to call the outcome a seismic model. 
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is the transition region beneath the convection zone and the uniformly rotating radiative 
interior. It is the region in which material is homogenized with the convection zone, 
producing the sound-speed anomaly evident in Figures 1 and 2 (and the corresponding 
near-discontinuity in density). Therefore, its thic kness can be dete rmined by calibrat- 
ing the magnitude of the sound-speed anomaly dEUiott et al.| [l998). The procedure is 
precise, but the accuracy of the outcome depends crucially on the accuracy with which 
gravitational settling has been taken into account in the reference model, and, probably 
to a lesser extent, on th e structure of the mixed layer. Alterna tivel y, one can take the 



name literally, as have iKosoviche v ( 1996), Charbonneau ( 1998) and Charbonneau et al.1 



(Il999h . and try to measure the extent of the rotational shear by calibrating a plausible 
parametrized function (there is yet no reliable theoretical prediction of the functional 
form) against an inversion for angular velocity. 

A related calibration concerns the form of the shear itself. Straightforward inver- 
sions provide only a smooth variation with latitude. Yet if the magnetic field is dragged 
into the convect i on zo ne at mid latitudes by the upwelling tachocline circulation that 
Spiegel & Zahn ( 1992) and Mclntyre and I (1998) have described, shouldn't the conse- 



quent Maxwell stresses create a finite region of zero shear? Sekii and I have designed 
a putative seismic signature for detecting such a region, and we hope to put it to use 
when next we have adequate time together. 

I conclude my discussion of this topic by addressing what I regard as a major cali- 
bration. It is designed to determine how much hydrogen has been consumed by nuclear 
reactions throughout the lifetime of the sun in order to estimate the sun's age. The long- 
term goal of this continuing investigation, which we admit might be pie in the sky, is to 
ascertain whether, and if so by how long, the meteorites condensed after the formation 
of the sun. The principle of the calibration is to use a signature of low-degree p-mode 
frequencies that is sensitive particularly to the stratification of the core. The stratifica- 
tion evolves with time, in a m anner that depends on the proportion of hyd rogen - and 
therefore helium 



time, m a m anner mat depends on tne proportion oi n ydrogen - and 
in the core dChristensen-Dalsgaard[l988af : lGough| [l995). Therefore, 



it is necessary to ascertain the absolute helium abundance too. An early discussion by 
( 19991) ignored the abundance issue. I tried a two-parameter (?©, Yq) 



Dziembowski et al. 



calibration (IGoughll2001l) using for data two values of the so-called small frequency 
separation d n j := v n j - v„_i / + 2 averaged over different domains of (n, I). The small sep- 
aration is most sensitive to changes in the core; alt hough, formally, d„ i depe nds just as 
much on the stratification of the rest of the star (e.g. lGough & Novotnylll990h . the strat- 
ification outside the core hardly varies with Fo - or, equivalently, Zq - and fa, and it was 
hoped that the two different averages would weight fa and Yq sufficiently differently to 
enable them to be distinguished. That turned out to be only marginally possible, as be- 
co mes evident by comparing the i ntegrands for d„j a s fa a nd Zq are varied (illustrated 
by iGough & Novotnvl dl990f) and iHoudek & Goughl (1201 ll) . respectively). Therefore 



the calibration, which yielded t Q = 4.57Ga, and a rather high Zn, is uncomfortably 
susceptible to frequency and modelling errors. Soon afterwards, iBonanno et al.l (120021) 
repeated the more robust single-parameter calibration of Dziembows ki et al. f or fa b y 
simply adopting a plausible value of Zq, obtaining the same age as iGoughl fcOOlh . 
Subsequently Houdek and I (2011) developed a more robust two-pa rameter procedure, 
using the oscillatory signature of helium ionization (IGoughl Il990al : ?) in low-degree 
p modes to measure Y. Our result, which we hoped to be more reliable than what is 
obtained from a single-parameter fit, is fa = 4.60 ± 0.04 Ga and Zq = 0.0155 ± 0.0005, 
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with present-day surface abundances F s = 0.224, Z s = 0.0142. It is interesting that the 
value found for Z s is closer to m odern spectroscopic ally determined values - 0.0134 
(lAsplund et alJl2009h and 0.0153 (ICaffau et alJl201 lh - than the value 0.018 of Model 
S whose sound speed in the radiative envelope is more-or-less correct. 



Recently Doga n et alJ d2010) have refined the single-parameter calibration, obtain- 
ing t Q = 4.57 ± 0.08 Ga, with an imposed heavy-element abundance adjusted to yield 
Z s /X s = 0.0245, as in Model S. This age is essentially in agreement with Bonanno, 
Schattl and Paterno ' s earlier result. Meteoritic ages he between 4.563 and 4.576 Ga 
dAmelin et ai1l2002l : Ijacobsen et al.ll2008l . 120091: iBouvier & Wadhwall2010h . On con- 
sidering the the two-parameter calibration of iHoudek & Gough (1201 lh . Christensen- 
Dalsgaard privately complained that the latter leads to a model that is seismically unac- 
ceptable (as, of course, it must be, because the opacity is too low, and therefore so too 
is the sound speed in the radiative envelope outside the core). That remark begs the in- 
teresting question, to which I have already alluded, of whether or not with an imperfect 
model it is better to satisfy conditions in the envelope when trying to assess conditions 
in the core. This is an important issue of principle for model calibration - I invite you 
to ignore the fact that in this particular example the final answers agree to within their 
quoted precision, for that is beside the point - to which I now turn my attention. 



6. The effect of hidden parameters on model-fitting 



With the solar age calibration in mind, I consider a class of solar models depending 
explicitly on two parameters, £ and which could be t Q and Zq. Bearing in mind that 
none of those models is seismically consistent with the sun, I imagine there to be a 
broader, virtual, class, extended in some (unknown to me) way so as to encompass the 
sun, and represent the distance from the sun of any of my explicit models to be char- 
acterized by the (hidden) parameter rj, whose value is unknowable to the calibration. I 
normalize the parameters in such a way that , tj, f)= (0, 0, 0) represents the real sun, 
and I presume that the largest values of (£, rj, £) that I need to consider are small enough 
to permit linearization of the model differences from the sun. I point out that in real life 
£ rj, £ are likely to be functions (or simply vectors, if those functions are considered to 
have been expanded in terms of a basis set). 

I now consider making a series of different observations O,, i = 1, 2, subject to 
random unknown errors which for simplicity I take to be independent. The values of 
Oj are functional of the structure. They too are normalized such that their exact values 
for the sun are zero. They can therefore be represented as a linear combination of the 
model-specifying parameters thus: 

O i = a£+fl i n + y i £-ei = 0. (4) 

The intention is to calibrate the models with these data, the principal interest being in 
the value of 

The coefficients a, and y,- are, of course, known functionals of the models, but we 
have no idea of the relation between /3, and the models, nor even the measurements. 
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Nevertheless, I assume for simplicity that the parameters (a;,A'»y«) axQ scaled such 
that the data errors e* have the same variance cr 2 . Then it is reasonable to attempt a 
calibration by minimizing 

1=1 

The results will be found to depend on the known coefficients 

i i i 

and the error combinations 

i i 

they depend also on the unknown coefficients 

D = Yafii, E = YftYi. (8) 

i i 

6.1. Full calibration 

The full calibration is the result of minimizing £ with respect to both £ and r\\ 

£ = & := A" 1 [(BE - CD)j] + Be h - Ce a ] ; A = AC - B 2 . (9) 
It has expectation 

| f = A-\BE-CDyn (10) 

and error-variance 

e\ = A-'Ccr 2 =: cr* . (11) 

The corresponding expressions for £ are similar. Note that the error variance is a 
measure of the precision of the calibration procedure; it is not an absolute indicator of 
the accuracy of the outcome. 

6.2. Partial calibration 

Minimizing £ with respect to £ for an assumed value of £ yields 

t = f p :=-A-\B( + Dri + ea), (12) 

yielding 

^ = -A- 1 (^ + ZJi/), ^ = A-W=:a^. (13) 

It is perhaps pertinent to remark that as the number / of measurements is increased, 
the error-variance cr 2 decreases, because A increases: adding new information, taken 
properly into account, never decreases precision. That is true also when the errors are 
correlated. 
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6.3. Comparison of the calibrations 

We first notice that had the models encompassed the sun the parameter 77 would have 
been redundant. It could have been set to zero, which is equivalent to having ignored 
the possibility of its existence. Then £f - 0, the correct answer; but £ p = -A~ l B£, 
which is unlikely to vanish unless the correct value of namely zero, were adopted. 
However, that is not the situation for most calibrations, entire solar models being an 
archetypical example. 

In the pertinent case r\ is not zero, and neither £f nor £ p is likely to vanish. More- 
over, because each is susceptible to the unknown 8, in different ways, one cannot com- 
pare the offsets in a reliable way, even when f = in the partial calibration. However, 
one can readily show, from Schwarz's inequality, that A > 0, and hence 

fff-<r* = A- l B 2 A- 1 >0; (14) 

reducing the parameter space of the calibration necessarily increases the precision 
(whether or not the measurement errors are independent). But does it improve the 
accuracy, even when the correct choice of the fixed parameters is made? 

The 77-dependent offsets given by equations (fTOl t and ( [TBI cannot be compared be- 
cause we have no idea of the values of the coefficients D and E. We don't even know 
physically what they mean, for they represent what we don't know. But as an exercise 
with possibly very little meaning - indeed it is certainly not easy to interpret the results 
- 1 have conducted statistical computations in which the coefficients (a^yS^y,) and if) 
were varied randomly (initially each uniformly and independently distributed between 
-1 and +1) to simulate random situations under calibration, in each case drawing a 
sufficient number of examples to obviate serious stochasticity in the outcome: 100 av- 
erages of 1000 examples with differing r\ of using / = 1000 different measurements 
defined by (a,,/?,, y,) in a calibration. In the first case I assumed ignorance of which 
in the partial calibration I drew from a set of random numbers disuibuted identically to 

the other parameters. Both calibrations yielded small average values and ^ - they 
should have, because Bjij were distributed symmetrically about zero - although in 70 

per cent of the cases £jr was smaller. When instead £ was set to zero - the correct value - 

in about 70 per cent of the partial calibrations & was smaller. However, when £ is set to 

an incorrect value, significantly different from zero, then f| was almost always smaller. 
Qualitatively similar results were obtained when (a,,/?,, y,) and 77 were distributed nor- 
mally. I accept that perusal of equations (fT0l-([T3l shows that the outcome must depend 
on the details of the distribution functions chosen for the parameters (a,-,/3,-, y ( ; 77) that 
define the situations being calibrated. Nonetheless, if one simply accepts the two sim- 
ple distributions that I have adopted, then taking all these results together suggests the 
following rule of thumb: 

Accuracy declines with increasing precision. 

It is extremely important to appreciate that this rule is merely a guide. It is not an 
inviolable law. Indeed, there are many obvious counterexamples. But it may serve the 



23 



useful purpose of inducing one to investigate more carefully the analysis that one has 
in hand. 

The rule serves the sobering purpose of suggesting that a partial calibration is 
likely to be less reliable than one incorporating a broader space of possibilities, even 
though the former is more precise. It is not uncommon, both in our discipline and 
elsewhere in science, for a careful extensively researched calibration to be repeated by a 
more restricted one, necessarily with higher precision, and the latter subsequently being 
used by others in preference in the possibly mistaken belief that it is more accurate. 

I conclude with a few more obvious remarks in this vein. First it goes without say- 
ing that precision improves with better data. And accuracy does not decline - indeed, 
it must improve too. But better data are not required for the direct purpose in hand if 
uncertainty in the calibration is dominated by rj. Of course, it is not always evident how 
such a situation might be detected, let alone assessed. Furthermore, better data might 
also be useful indirectly, when using different procedures, for assessing the deleterious 
impact of rj. There are also obvious classes of procedures that can increase accuracy 
by decreasing parameter space, perhaps even more than the amount by which precision 
is increased. An obvious example is the application of constraints on the manner in 
which the data O, are combined in the calibration to eliminate contamination by known 
aspects of the model whose actual influence on Oj is not known, analogous to the elim- 
ination of near surface effects in structure inversions via the function P/I in equations 
© and ©. 



7. Summary 

Simple models have been useful for providing an initial rough understanding of the 
structure of the sun and the manner in which the properties of helioseismic oscillations 
can be used for structural and kinematical diagnosis. The analyses can be of toy models 
or asymptotic approximations to the real sun. The analytical formula so obtained have 
played an extremely important role in designing diagnostic procedures. It has been 
argued that numerical surveys can be just as good, and that the simple analytical proce- 
dures are therefore unnecessary. I do not deny the sentiment behind that opinion. But 
I do deny the reality. To be sure, in the right hands numerical surveys have contributed 
greatly to our knowledge: partly directly, and partly by providing benchmarks against 
which the analytical formulae can be tested and sometimes calibrated. But it is also 
important to realize that in practice it has been principally the approximate analytical 
formulae that have motivated the design of the seismic diagnostics in use today; those 
formulae may not have been necessary, but they have surely accelerated the develop- 
ment of our ability to advance understanding. To quote a few examples, it was simple 
models that led to the first seismic calibration of the depth of the solar convection zone, 
and the determination of the helium abundance, the interior rotation and some aspects 
of the structure of the core. It has also been useful to adopt hybrid procedures, using, 
for example, simple, sometimes asymptotic, methods to estimate deviations of numer- 
ically computed models from the real sun. Initial investigations of these matters have 
been followed up by other procedures, necessary not only to improve precision but, 
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more importantly, to view the situation differently in order to detect whether particu- 
lar individual procedures are biassed by hidden agents. By so doing, the reliability of 
the inferences is increased, sometimes causing estimates of accuracy to be moderated. 
This point is illustrated by contrasting the first direct seismological determination of the 
depth of the convection zone, using a small suite of methods whose differences deter- 
mined the accuracy, with subsequent single-procedure determinations with necessarily 
greater precision to which have too hastily been attributed concomitant accuracy. The 
simple toy model discussed §6 exemplifies that point. 

The broad message that I am trying to put forward is that helioseismology is not 
dead. Although for most astronomers asteroseismology offers a wider arena of discov- 
ery, for the physicist there is still much to investigate in the sun, possibly more than in 
other stars, at least in the short term when there is yet too much that is unknown about 
other stars to isolate issues in physics from uncertainties in stellar structure. I admit 
there have been counterexamples. Therefore, as I have illustrated in this discussion, 
there remains a substantial amount of work that theorists must attend to. There is also 
more work for data analysts too, who, sadly, are few. I repeat my plea for more atten- 
tion to estimating error correlations, for they c an influence inferences significantly (e.g. 
Howe & Thompson! 1 9961 : iGough & Sekd2002h . possibly even to the extent of Massing 
results by an amoun t that is muc h greater than the apparent variance of the propagated 
random uncertainty (Gough ll996h . The community investigating such matters is small. 
But the importance of the endeavour is not. 
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